智能论文笔记

Masked Video Modeling with Correlation-aware Contrastive Learning for Breast Cancer Diagnosis in Ultrasound

Zehui Lin , Ruobing Huang , Dong Ni , Jiayi Wu , Baoming Luo

分类：计算机视觉

2022-08-21

乳腺癌是女性癌症死亡的主要原因之一。作为乳房筛查的主要输出，乳房超声（US）视频包含用于癌症诊断的独家动态信息。但是，视频分析的培训模型是不平凡的，因为它需要一个大量的数据集，而注释也很昂贵。此外，乳房病变的诊断面临着独特的挑战，例如类间相似性和阶层内变异。在本文中，我们提出了一种开创性的方法，该方法直接利用了计算机辅助乳腺癌诊断中的视频。它利用掩盖的视频建模作为预防性的，以减少对数据集大小和详细注释的依赖。此外，开发了相关性的对比损失，以促进良性和恶性病变之间内部和外部关系的识别。实验结果表明，我们提出的方法实现了有希望的分类性能，并且可以超越其他最先进的方法。

translated by 谷歌翻译

Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling

Jiamin Liang , Xin Yang , Yuhao Huang , Kai Liu , Xinrui Zhou , Xindi Hu , Zehui Lin , Huanjia Luo , Yuanji Zhang , Yi Xiong

分类：计算机视觉 | 人工智能 | 机器学习

2022-07-01

超声（US）广泛用于实时成像，无辐射和便携性的优势。在临床实践中，分析和诊断通常依赖于美国序列，而不是单个图像来获得动态的解剖信息。对于新手来说，这是一项挑战，因为使用患者的足够视频进行练习是临床上不可行的。在本文中，我们提出了一个新颖的框架，以综合高保真美国视频。具体而言，合成视频是通过基于给定驾驶视频的动作来动画源内容图像来生成的。我们的亮点是三倍。首先，利用自我监督学习的优势，我们提出的系统以弱监督的方式进行了培训，以进行关键点检测。然后，这些关键点为处理美国视频中的复杂动态动作提供了重要信息。其次，我们使用双重解码器将内容和纹理学习解除，以有效地减少模型学习难度。最后，我们采用了对抗性训练策略，并采用了GAN损失，以进一步改善生成的视频的清晰度，从而缩小了真实和合成视频之间的差距。我们在具有高动态运动的大型内部骨盆数据集上验证我们的方法。广泛的评估指标和用户研究证明了我们提出的方法的有效性。

translated by 谷歌翻译

Instance-specific Label Distribution Regularization for Learning with Label Noise

Zehui Liao , Shishuai Hu , Yutong Xie , Yong Xia

分类：计算机视觉

2022-12-16

Modeling noise transition matrix is a kind of promising method for learning with label noise. Based on the estimated noise transition matrix and the noisy posterior probabilities, the clean posterior probabilities, which are jointly called Label Distribution (LD) in this paper, can be calculated as the supervision. To reliably estimate the noise transition matrix, some methods assume that anchor points are available during training. Nonetheless, if anchor points are invalid, the noise transition matrix might be poorly learned, resulting in poor performance. Consequently, other methods treat reliable data points, extracted from training data, as pseudo anchor points. However, from a statistical point of view, the noise transition matrix can be inferred from data with noisy labels under the clean-label-domination assumption. Therefore, we aim to estimate the noise transition matrix without (pseudo) anchor points. There is evidence showing that samples are more likely to be mislabeled as other similar class labels, which means the mislabeling probability is highly correlated with the inter-class correlation. Inspired by this observation, we propose an instance-specific Label Distribution Regularization (LDR), in which the instance-specific LD is estimated as the supervision, to prevent DCNNs from memorizing noisy labels. Specifically, we estimate the noisy posterior under the supervision of noisy labels, and approximate the batch-level noise transition matrix by estimating the inter-class correlation matrix with neither anchor points nor pseudo anchor points. Experimental results on two synthetic noisy datasets and two real-world noisy datasets demonstrate that our LDR outperforms existing methods.

translated by 谷歌翻译

When Quantum Information Technologies Meet Blockchain in Web 3.0

Minrui Xu , Xiaoxu Ren , Dusit Niyato , Jiawen Kang , Chao Qiu , Zehui Xiong , Xiaofei Wang , Victor C. M. Leung

分类：人工智能

2022-11-29

With the drive to create a decentralized digital economy, Web 3.0 has become a cornerstone of digital transformation, developed on the basis of computing-force networking, distributed data storage, and blockchain. With the rapid realization of quantum devices, Web 3.0 is being developed in parallel with the deployment of quantum cloud computing and quantum Internet. In this regard, quantum computing first disrupts the original cryptographic systems that protect data security while reshaping modern cryptography with the advantages of quantum computing and communication. Therefore, in this paper, we introduce a quantum blockchain-driven Web 3.0 framework that provides information-theoretic security for decentralized data transferring and payment transactions. First, we present the framework of quantum blockchain-driven Web 3.0 with future-proof security during the transmission of data and transaction information. Next, we discuss the potential applications and challenges of implementing quantum blockchain in Web 3.0. Finally, we describe a use case for quantum non-fungible tokens (NFTs) and propose a quantum deep learning-based optimal auction for NFT trading to maximize the achievable revenue for sufficient liquidity in Web 3.0. In this way, the proposed framework can achieve proven security and sustainability for the next-generation decentralized digital society.

translated by 谷歌翻译

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

Andrey Ignatov , Grigory Malivenko , Radu Timofte , Lukasz Treszczotko , Xin Chang , Piotr Ksiazek , Michal Lopuszynski , Maciej Pioro , Rafal Rudnicki , Maciej Smyl

分类：计算机视觉

2022-11-07

Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth estimation solutions that can show a real-time performance on IoT platforms and smartphones. For this, the participants used a large-scale RGB-to-depth dataset that was collected with the ZED stereo camera capable to generated depth maps for objects located at up to 50 meters. The runtime of all models was evaluated on the Raspberry Pi 4 platform, where the developed solutions were able to generate VGA resolution depth maps at up to 27 FPS while achieving high fidelity results. All models developed in the challenge are also compatible with any Android or Linux-based mobile devices, their detailed description is provided in this paper.

translated by 谷歌翻译

Enhanced Decentralized Federated Learning based on Consensus in Connected Vehicles

Xiaoyan Liu , Zehui Dong , Zhiwei Xu , Siyuan Liu , Jie Tian

分类：机器学习 | 人工智能

2022-09-22

有关连接车辆的高级研究最近针对将车辆到所有设施（V2X）网络与机器学习（ML）工具（ML）工具和分布式决策制定的集成。联合学习（FL）正在作为训练机器学习（ML）模型（包括V2X网络中的车辆）的新范式出现。与其将培训数据共享和上传到服务器，不如将模型参数（例如，神经网络的权重和偏见）更新，由大量的互连车辆种群应用，充当本地学习者。尽管有这些好处，但现有方法的局限性是集中式优化，它依靠服务器来汇总和融合本地参数，从而导致单个故障点和扩展问题的缺点，以增加V2X网络大小。同时，在智能运输方案中，从车载传感器收集的数据是多余的，这会降低聚合的性能。为了解决这些问题，我们探索了一个分散数据处理的新颖想法，并引入了用于网络内工具的联合学习框架，C-DFL（基于共识的分散联盟学习），以解决有关连接车辆的联合学习并提高学习质量的联盟学习。已经实施了广泛的仿真来评估C-DFL的性能，该表明C-DFL在所有情况下都胜过常规方法的性能。

translated by 谷歌翻译

LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices

Zhenyu Li , Zehui Chen , Jialei Xu , Xianming Liu , Junjun Jiang

分类：计算机视觉

2022-09-02

单眼深度估计是计算机视觉社区的重要任务。尽管巨大的成功方法取得了出色的结果，但其中大多数在计算上都是昂贵的，并且不适用于实时推论。在本文中，我们旨在解决单眼深度估计的更实际的应用，该解决方案不仅应考虑精度，而且还应考虑移动设备上的推论时间。为此，我们首先开发了一个基于端到端学习的模型，其重量大小（1.4MB）和短的推理时间（Raspberry Pi 4上的27fps）。然后，我们提出了一种简单而有效的数据增强策略，称为R2 CROP，以提高模型性能。此外，我们观察到，只有一个单一损失术语训练的简单轻巧模型将遭受性能瓶颈的影响。为了减轻此问题，我们采用多个损失条款，在培训阶段提供足够的限制。此外，采用简单的动态重量重量策略，我们可以避免耗时的超参数选择损失项。最后，我们采用结构感知的蒸馏以进一步提高模型性能。值得注意的是，我们的解决方案在MAI＆AIM2022单眼估计挑战中排名第二，Si-RMSE为0.311，RMSE为3.79，推理时间为37 $ ms $，在Raspberry Pi上进行了测试4.值得注意的是，我们提供了，我们提供了。挑战最快的解决方案。代码和模型将以\ url {https://github.com/zhyever/litedepth}发布。

translated by 谷歌翻译

HTML版本

Boundary-Aware Network for Abdominal Multi-Organ Segmentation

Shishuai Hu , Zehui Liao , Yong Xia

分类：计算机视觉

2022-08-29

自动化的腹部多器官分割是计算机辅助诊断腹部器官相关疾病的至关重要但具有挑战性的任务。尽管许多深度学习模型在许多医学图像分割任务中取得了显着的成功，但由于腹部器官的不同大小以及它们之间的含糊界限，腹部器官的准确分割仍然具有挑战性。在本文中，我们提出了一个边界感知网络（BA-NET），以分段CT扫描和MRI扫描进行腹部器官。该模型包含共享编码器，边界解码器和分割解码器。两个解码器都采用了多尺度的深度监督策略，这可以减轻可变器官尺寸引起的问题。边界解码器在每个量表上产生的边界概率图被用作提高分割特征图的注意。我们评估了腹部多器官细分（AMOS）挑战数据集的BA-NET，并获得了CT扫描的多器官分割的平均骰子分数为89.29 $ \％$，平均骰子得分为71.92 $ \％$ \％$ \％ MRI扫描。结果表明，在两个分割任务上，BA-NET优于NNUNET。

translated by 谷歌翻译

Boundary-Aware Network for Kidney Parsing

Shishuai Hu , Yiwen Ye , Zehui Liao , Yong Xia

分类：计算机视觉

2022-08-29

肾脏结构细分是计算机辅助诊断基于手术的肾癌的至关重要但具有挑战性的任务。尽管许多深度学习模型在许多医学图像分割任务中取得了显着的成功，但由于肾脏肿瘤的尺寸可变，肾脏肿瘤及其周围环境之间的歧义范围可变，因此对计算机层析造影血管造影（CTA）图像的肾脏结构的准确分割仍然具有挑战性。。在本文中，我们在CTA扫描中提出了一个边界感知网络（BA-NET），以分段肾脏，肾脏肿瘤，动脉和静脉。该模型包含共享编码器，边界解码器和分割解码器。两个解码器都采用了多尺度的深度监督策略，这可以减轻肿瘤大小可变的问题。边界解码器在每个量表上产生的边界概率图被用作提高分割特征图的注意。我们在肾脏解析（KIPA）挑战数据集上评估了BA-NET，并通过使用4倍的交叉验证来实现CTA扫描的肾脏结构细分的平均骰子得分为89.65 $ \％$。结果证明了BA-NET的有效性。

translated by 谷歌翻译

Label Propagation for 3D Carotid Vessel Wall Segmentation and Atherosclerosis Diagnosis

Shishuai Hu , Zehui Liao , Yong Xia

分类：计算机视觉

2022-08-29

颈动脉血管壁分割是在计算机辅助诊断动脉粥样硬化中的至关重要但具有挑战性的任务。尽管许多深度学习模型在许多医学图像分割任务中取得了显着的成功，但由于注释有限和异构动脉，对磁共振（MR）图像上颈动脉壁（MR）图像的准确分割仍然具有挑战性。在本文中，我们在3D MR图像上提出了一个半监督标签的传播框架，以分段管腔，正常容器壁和动脉粥样硬化血管壁。通过插值提供的注释，我们获得了3D连续标签，用于训练3D分割模型。借助训练有素的模型，我们生成了未标记切片的伪标签，以将其纳入模型训练。然后，我们使用整个MR扫描和传播标签来重新培养分割模型并改善其稳健性。我们评估了颈动脉血管墙分割和动脉粥样硬化诊断（COSMOS）挑战数据集上的标签传播框架，并在测试数据集中获得了83.41 \％的Quanm分数，这使在线评估排行榜上获得了1-ST的位置。结果证明了拟议框架的有效性。

translated by 谷歌翻译